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This Raw Listing contains the General 
Information Section and those Sequences 
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(1) 



SEQUENCE LISTING 

General Information: 
i) APPLICANT: Chiron Corporation 



Correct ■ ^ ' ,y 



(ii) TITLE OF INVENTION: Methods for Administration of 

Recombinant Gene Delivery Vehicles for Treatment of Hemophilia and Other Dis 



(iii) NUMBER OF SEQUENCES: (&3 j /L^tl*-'^ (^ /^) 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Chiron Corporation 

(B) STREET: 4560 Horton Street 

(C) CITY: Emeryville 

(D) STATE: California 

(E) COUNTRY: U.S.A. 

(F) ZIP: 94608 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Kruse, Norman J. 

(B) REGISTRATION NUMBER: 35,235 

(C) REFERENCE/DOCKET NUMBER: 1155.005 



As 



I- 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (510) 923-3520 

(B) TELEFAX: (510) 655-3542 



ERRORED SEQUENCES FOLLOW: 



~JteZ ( Pel) , 

~ /tt <£ 
^?L^c^ f <a s^U fro 




PAGE: 2 "~RAW SEQUENCE LISTING DATE: 02/10/98 

PATENT APPLICATION US/09/001,039 TIME: 1 1 :36:47 

INPUT SET: S23358.raw 

10 3 (2) INFORMATION FOR SEQ ID NO: 7: 

104 (i) SEQUENCE CHARACTERISTICS : 

— > 105 (A) LENGTH: 77 base pairs 

106 (B) TYPE: nucleic acid 

107 (C) STRANDEDNESS: single /Xl>'r-*A, £ - r i L^, 

108 (D) TOPOLOGY: linear 

109 (ii) MOLECULE TYPE: DNA (genomic) 

110 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

111 AGTGAATTCG AGCTCGGTAC CCGGGGATCC TCTAGAGTCG ACCTGCAGGC 
— > 112 ATGCAAGCTT 60 — 

113 GGCGTAATCA TGGTCAT 7 7 

114 
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(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 5 : 

GCGCCAGTCC TCCGATTGAC TGAGTCGCCC GGGTACCCGT GTATCCAATA 
AACCCTCTTG 60 

CAGTTGCATC CGACTTGTGG TCTCGCTGTT CCTTGGGAGG GTCTCCTCTG 
AGTGATTGAC 120 

TACCCGTCAG CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCGGGAGA 
CCCCTGCCCA 180 

GGGACCACCG ACCCACCACC GGGAGGTAAG CTGGCCAGCA ACTTATCTGT 
GTCTGTCCGA 240 

TTGTCTAGTG TCTATGACTG ATTTTATGCG CCTGCGTCGG TACTAGTTAG 
CTAACTAGCT 300 

CTGTATCTGG CGGACCCGTG GTGGAACTGA CGAGTTCGGA ACACCCGGCC 
GCAACCCTGG 360 

GAGACGTCCC AGGGACTTCG GGGGCCGTTT TTGTGGCCCG ACCTGAGTCC 
AAAAATCCCG 420 

ATCGTTTTGG ACTCTTTGGT GCACCCCCCT TAGAGGAGGG ATATGTGGTT 
CTGGTAGGAG 480 

ACGAGAACCT AAAACAGTTC CCGCCTCCGT CTGAATTTTT GCTTTCGGTT 
TGGGACCGAA 540 

GCCGCGCCGC GCGTCTTGTC TGCTGCAGCA TCGTTCTGTG TTGTCTCTGT 
CTGACTGTGT 600 

TTCTGTATTT GTCTGAGAAT ATGGGCCAGA CTGTTACCAC TCCCTTAAGT 




Explanato££t error that occurred throughout the Sequence lifting: 

For or your nucleic sequences, the nucleic number at the end of each line 

"wrapped" down to the next line. This occurred if your file was retrieved in a word ^ 
processor after creating it in Patent In. Your word processor probably has different ^ 
margin settings than those used in Patentln. (A right margin set at least to .3 in ^ 
your word processor will prevent wrapping) 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/001,039 



DATE: 02/10/98 
TIME: 11:36:49 



INPUT SET: S23358.raw 



TTGACCTTAG 



660 



GTCACTGGAA AGATGTCGAG CGGATCGCTC ACAACCAGTC GGTAGATGTC 
AAGAAGAGAC 720 

GTTGGGTTAC CTTCTGCTCT GCAGAATGGC CAACCTTTAA CGTCGGATGG 
CCGCGAGACG 780 

GCACCTTTAA CCGAGACCTC ATCACCCAGG TTAAGATCAA GGTCTTTTCA 
CCTGGCCCGC 840 

ATGGACACCC AGACCAGGTC CCCTACATCG TGACCTGGGA AGCCTTGGCT 
TTTGACCCCC 900 

CTCCCTGGGT CAAGCCCTTT GTACACCCTA AGCCTCCGCC TCCTCTTCCT 
CCATCCGCCC 960 

CGTCTCTCCC CCTTGAACCT CCTCGTTCGA CCCCGCCTCG ATCCTCCCTT 
TATCCAGCCC 1020 

TCACTCCTTC TCTAGGCGCC AAACCTAAAC CTCAAGTTCT TTCTGACAGT 
GGGGGGCCGC 1080 

TCATCGACCT ACTTACAGAA GACCCCCCGC CTTATAGGGA CCCAAGACCA 
CCCCCTTCCG 1140 

ACAGGGACGG AAATGGTGGA GAAGCGACCC CTGCGGGAGA GGCACCGGAC 
CCCTCCCCAA 1200 

TGGCATCTCG CCTACGTGGG AGACGGGAGC CCCCTGTGGC CGACTCCACT 
ACCTCGCAGG 1260 

CATTCCCCCT CCGCGCAGGA GGAAACGGAC AGCTTCAATA CTGGCCGTTC 
TCCTCTTCTG 1320 

ACCTTTACAA CTGGAAAAAT AATAACCCTT CTTTTTCTGA AGATCCAGGT 
AAACTGACAG 1380 

CTCTGATCGA GTCTGTTCTC ATCACCCATC AGCCCACCTG GGACGACTGT 
CAGCAGCTGT 1440 

TGGGGACTCT GCTGACCGGA GAAGAAAAAC AACGGGTGCT CTTAGAGGCT 
AGAAAGGCGG 1500 

TGCGGGGCGA TGATGGGCGC CCCACTCAAC TGCCCAATGA AGTCGATGCC 
GCTTTTCCCC 1560 

TCGAGCGCCC AGACTGGGAT TACACCACCC AGGCAGGTAG GAACCACCTA 
GTCCACTATC 1620 

GCCAGTTGCT CCTAGCGGGT CTCCAAAACG CGGGCAGAAG CCCCACCAAT 
TTGGCCAAGG 1680 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/001,039 



DATE: 02/10/98 
TIME: 1 1:36:52 



INPUT SET: S23358.raw 



TAAAAGGAAT AACACAAGGG CCCAATGAGT CTCCCTCGGC CTTCCTAGAG 
AGACTTAAGG 1740 

AAGCCTATCG CAGGTACACT CCTTATGACC CTGAGGACCC AGGGCAAGAA 
ACTAATGTGT 1800 

CTATGTCTTT CATTTGGCAG TCTGCCCCAG ACATTGGGAG AAAGTTAGAG 
AGGTTAGAAG 1860 

ATTTAAAAAA CAAGACGCTT GGAGATTTGG TTAGAGAGGC AGAAAAGATC 
TTTAATAAAC 1920 

GAGAAACCCC GGAAGAAAGA GAGGAACGTA TCAGGAGAGA AACAGAGGAA 
AAAGAAGAAC 1980 

GCCGTAGGAC AGAGGATGAG CAGAAAGAGA AAGAAAGAGA TCGTAGGAGA 
CATAGAGAGA 2040 

TGAGCAAGCT ATTGGCCACT GTCGTTAGTG GACAGAAACA GGATAGACAG 
GGAGGAGAAC 2100 

GAAGGAGGTC CCAACTCGAT CGCGACCAGT GTGCCTACTG CAAAGAAAAG 
GGGCACTGGG 2160 

CTAAAGATTG TCCCAAGAAA CCACGAGGAC CTCGGGGACC AAGACCCCAG 
ACCTCCCTCC 2 2 20 

TGACCCTAGA TGACTAGGGA GGTCAGGGTC AGGAGCCCCC CCCTGAACCC 
AGGATAACCC 2280 

TCAAAGTCGG GGGGCAACCC GTCACCTTCC TGGTAGATAC TGGGGCCCAA 
CACTCCGTGC 2340 

TGACCCAAAA TCCTGGACCC CTAAGTGATA AGTCTGCCTG GGTCCAAGGG 
GCTACTGGAG 2400 

GAAAGCGGTA TCGCTGGACC ACGGATCGCA AAGTACATCT AGCTACCGGT 
AAGGTCACCC 24 60 

ACTCTTTCCT CCATGTACCA GACTGTCCCT ATCCTCTGTT AGGAAGAGAT 
TTGCTGACTA 2 520 

AACTAAAAGC CCAAATCCAC TTTGAGGGAT CAGGAGCTCA GGTTATGGGA 
CCAATGGGGC 2 580 

AGCCCCTGCA AGTGTTGACC CTAAATATAG AAGATGAGCA TCGGCTACAT 
GAGACCTCAA 2 640 

AAGAGCCAGA TGTTTCTCTA GGGTCCACAT GGCTGTCTGA TTTTCCTCAG 
GCCTGGGCGG 2 700 

AAACCGGGGG CATGGGACTG GCAGTTCGCC AAGCTCCTCT GATCATACCT 
CTGAAAGCAA 2 760 



/ 



/ 1 / 
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PAGE: 5 RAW SEQUENCE LISTING DATE: 02/10/98 

PATENT APPLICATION US/09/001,039 TIME: 11:36:54 

INPUT SET: S23358.raw 

334 

3 35 CCTCTACCCC CGTGTCCATA AAACAATACC CCATGTCACA AGAAGCCAGA 

— > 336 CTGGGGATCA 2820 

337 

3 38 AGCCCCACAT ACAGAGACTG TTGGACCAGG GAATACTGGT ACCCTGCCAG 

— > 339 TCCCCCTGGA 2880 

340 

341 ACACGCCCCT GCTACCCGTT AAGAAACCAG GGACTAATGA TTATAGGCCT 

— > 34 2 GTCCAGGATC 2940 

343 

344 TGAGAGAAGT CAACAAGCGG GTGGAAGACA TCCACCCCAC CGTGCCCAAC 

— > 345 CCTTACAACC 3000 

346 

34 7 TCTTGAGCGG GCTCCCACCG TCCCACCAGT GGTACACTGT GCTTGATTTA 

— > 348 AAGGATGCCT 3060 

349 

35 0 TTTTCTGCCT GAGACTCCAC CCCACCAGTC AGCCTCTCTT CGCCTTTGAG < 

— > 351 TGG AG AG ATC 3120 \ i , j 

352 

35 3 CAGAGATGGG AATCTCAGGA CAATTGACCT GGACCAGACT CCCACAGGGT / ~~ 

— > 354 TTCAAAAACA 3180 

355 

356 GTCCCACCCT GTTTGATGAG GCACTGCACA GAGACCTAGC AGACTTCCGG 

— > 357 ATCCAGCACC 3240 

358 

35 9 CAGACTTGAT CCTGCTACAG TACGTGGATG ACTTACTGCT GGCCGCCACT 

— > 360 TCTGAGCTAG 3300 

361 

36 2 ACTGCCAACA AGGTACTCGG GCCCTGTTAC AAACCCTAGG GAACCTCGGG 

— > 363 TATCGGGCCT 3360 

364 

365 CGGCCAAGAA AGCCCAAATT TGCCAGAAAC AGGTCAAGTA TCTGGGGTAT 

— > 366 CTTCTAAAAG 3420 

367 

368 AGGGTCAGAG ATGGCTGACT GAGGCCAGAA AAGAGACTGT GATGGGGCAG 

— > 369 CCTACTCCGA 3480 

370 

371 AGACCCCTCG ACAACTAAGG GAGTTCCTAG GGACGGCAGG CTTCTGTCGC 

— > 372 CTCTGGATCC 3540 

373 

374 CTGGGTTTGC AGAAATGGCA GCCCCCTTGT ACCCTCTCAC CAAAACGGGG 

— > 375 ACTCTGTTTA 3600 

376 

3 77 ATTGGGGCCC AGACCAACAA AAGGCCTATC AAGAAATCAA GCAAGCTCTT 

— > 378 CTAACTGCCC 3660 

379 

380 CAGCCCTGGG GTTGCCAGAT TTGACTAAGC CCTTTGAACT CTTTGTCGAC 

— > 381 GAGAAGCAGG 3720 

382 

3 83 GCTACGCCAA AGGTGTCCTA ACGCAAAAAC TGGGACCTTG GCGTCGGCCG 

— > 384 GTGGCCTACC 3780 

385 

386 TGTCCAAAAA GCTAGACCCA GTAGCAGCTG GGTGGCCCCC TTGCCTACGG 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/001,039 



DATE: 02/10/98 
TIME: 1 1:36:57 



INPUT SET: S23358.raw 



ATGGTAGCAG 



3840 



CCATTGCCGT ACTGACAAAG GATGCAGGCA AGCTAACCAT GGGACAGCCA 

CTAGTCATTC 3900 

TGGCCCCCCA TGCAGTAGAG GCACTAGTCA AACAACCCCC CGACCGCTGG 

CTTTCCAACG 3960 

CCCGGATGAC TCACTATCAG GCCTTGCTTT TGGACACGGA CCGGGTCCAG 

TTCGGACCGG 4020 

TGGTAGCCCT GAACCCGGCT ACGCTGCTCC CACTGCCTGA GGAAGGGCTG 

CAACACAACT 4080 

GCCTTGATAT CCTGGCCGAA GCCCACGGAA CCCGACCCGA CCTAACGGAC 

CAGCCGCTCC 4140 

CAGACGCCGA CCACACCTGG TACACGGATG GAAGCAGTCT CTTACAAGAG 

GGACAGCGTA 4200 

AGGCGGGAGC TGCGGTGACC ACCGAGACCG AGGTAATCTG GGCTAAAGCC 

CTGCCAGCCG 4260 

GGACATCCGC TCAGCGGGCT GAACTGATAG CACTCACCCA GGCCCTAAAG 

ATGGCAGAAG 4320 

GTAAGAAGCT AAATGTTTAT ACTGATAGCC GTTATGCTTT TGCTACTGCC 

CATATCCATG 4380 

GAGAAATATA CAGAAGGCGT GGGTTGCTCA CATCAGAAGG CAAAGAGATC 

AAAAATAAAG 4440 

ACGAGATCTT GGCCCTACTA AAAGCCCTCT TTCTGCCCAA AAGACTTAGC 

ATAATCCATT 4500 

GTCCAGGACA TCAAAAGGGA CACAGCGCCG AGGCTAGAGG CAACCGGATG 

GCTGACCAAG 4560 

CGGCCCGAAA GGCAGCCATC ACAGAGACTC CAGACACCTC TACCCTCCTC 

ATAGAAAATT 4620 

CATCACCCTA CACCTCAGAA CATTTTCATT ACACAGTGAC TGATATAAAG 

GACCTAACCA 4680 

AGTTGGGGGC CATTTATGAT AAAACAAAGA AGTATTGGGT CTACCAAGGA 

AAACCTGTGA 4740 

TGCCTGACCA GTTTACTTTT GAATTATTAG ACTTTCTTCA TCAGCTGACT 

CACCTCAGCT 4800 

TCTCAAAAAT GAAGGCTCTC CTAGAGAGAA GCCACAGTCC CTACTACATG 

CTGAACCGGG 4860 
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PAGE: 7 RAW SEQUENCE LISTING DATE: 02/ 1 0/98 

PATENT APPLICATION US/09/001,039 TIME: 1 ! :36:59 

INPUT SET: S23358.raw 

44 0 ATCGAACACT CAAAAATATC ACTGAGACCT GCAAAGCTTG TGCACAAGTC 

— > 441 AACGCCAGCA 4920 

442 

44 3 AGTCTGCCGT TAAACAGGGA ACTAGGGTCC GCGGGCATCG GCCCGGCACT 

— - > 444 C ATTGGGAG A 4980 

445 

44 6 TCGATTTCAC CGAGATAAAG CCCGGATTGT ATGGCTATAA ATATCTTCTA 

— > 447 GTTTTTATAG 5040 

448 

44 9 ATACCTTTTC TGGCTGGATA GAAGCCTTCC CAACCAAGAA AGAAACCGCC 

— > 450 AAGGTCGTAA 5100 

451 

452 CCAAGAAGCT ACTAGAGGAG ATCTTCCCCA GGTTCGGCAT GCCTCAGGTA 

— > 453 TTGGGAACTG 5160 

454 

455 ACAATGGGCC TGCCTTCGTC TCCAAGGTGA GTCAGACAGT GGCCGATCTG 

— > 456 TTGGGGATTG 52 20 L 

457 .., jA 

458 ATTGGAAATT ACATTGTGCA TACAGACCCC AAAGCTCAGG CCAGGTAGAA 

— > 459 AGAATGAATA 5280 

460 

461 GAACCATCAA GGAGACTTTA ACTAAATTAA CGCTTGCAAC TGGCTCTAGA 

— > 462 GACTGGGTGC 5340 

463 

464 TCCTACTCCC CTTAGCCCTG TACCGAGCCC GCAACACGCC GGGCCCCCAT 

— > 465 GGCCTCACCC 5400 

466 

46 7 CATATGAGAT CTTATATGGG GCACCCCCGC CCCTTGTAAA CTTCCCTGAC 

— > 468 CCTGACATGA 5460 

469 

4 70 CAAGAGTTAC TAACAGCCCC TCTCTCCAAG CTCACTTACA GGCTCTCTAC 

— > 471 TTAGTCCAGC 5520 

472 

47 3 ACGAAGTCTG GAGACCTCTG GCGGCAGCCT ACCAAGAACA ACTGGACCGA 

— > 474 CCGGTGGTAC 5580 

475 

476 CTCACCCTTA CCGAGTCGGC GACACAGTGT GGGTCCGCCG ACACCAGACT 

— > 477 AAGAACCTAG 5640 

478 

47 9 AACCTCGCTG GAAAGGACCT TACACAGTCC TGCTGACCAC CCCCACCGCC 

— > 480 CTCAAAGTAG 5700 

481 

482 ACGGCATCGC AGCTTGGATA CACGCCGCCC ACGTGAAGGC TGCCGACCCC 

— > 483 GGGGGTGGAC 5760 

484 

4 85 CATCCTCTAG ACTGACATGG CGCGTTCAAC GCTCTCAAAA CCCCTTAAAA 

— > 486 ATAAGGTTAA 5820 

487 

488 CCCGCGAGGC CCCCTAATCC CCTTAATTCT TCTGATGCTC AGAGGGGTCA 

— > 489 GTACTGCTTC 5880 

490 

4 91 GCCCGGCTCC AGTCCTCATC AAGTCTATAA TATCACCTGG GAGGTAACCA 

— > 492 ATGGAGATCG 5940 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/001,039 



DATE: 02/10/98 
TIME: 1 1:37:02 
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INPUT SET: S23358.raw 



GGAGACGGTA TGGGCAACTT CTGGCAACCA CCCTCTGTGG ACCTGGTGGC 
CTGACCTTAC 6000 

CCCAGATTTA TGTATGTTAG CCCACCATGG ACCATCTTAT TGGGGGCTAG 
AATATCAATC 6060 

CCCTTTTTCT TCTCCCCCGG GGCCCCCTTG TTGCTCAGGG GGCAGCAGCC 
CAGGCTGTTC 6120 

CAGAGACTGC GAAGAACCTT TAACCTCCCT CACCCCTCGG TGCAACACTG 
CCTGGAACAG 6180 

ACTCAAGCTA GACCAGACAA CTCATAAATC AAATGAGGGA TTTTATGTTT 
GCCCCGGGCC 6240 

CCACCGCCCC CGAGAATCCA AGTCATGTGG GGGTCCAGAC TCCTTCTACT 
GTGCCTATTG 6300 

GGGCTGTGAG ACAACCGGTA GAGCTTACTG GAAGCCCTCC TCATCATGGG 
ATTTCATCAC 6360 

AGTAAACAAC AATCTCACCT CTGACCAGGC TGTCCAGGTA TGCAAAGATA 
ATAAGTGGTG 6420 

CAACCCCTTA GTTATTCGGT TTACAGACGC CGGGAGACGG GTTACTTCCT 
GGACCACAGG 6480 

ACATTACTGG GGCTTACGTT TGTATGTCTC CGGACAAGAT CCAGGGCTTA 
CATTTGGGAT 6540 

CCGACTCAGA TACCAAAATC TAGGACCCCG CGTCCCAATA GGGCCAAACC 
CCGTTCTGGC 6600 

AGACCAACAG CCACTCTCCA AGCCCAAACC TGTTAAGTCG CCTTCAGTCA 
CCAAACCACC 6660 

CAGTGGGACT CCTCTCTCCC CTACCCAACT TCCACCGGCG GGAACGGAAA 
ATAGGCTGCT 6720 

AAACTTAGTA GACGGAGCCT ACCAAGCCCT CAACCTCACC AGTCCTGACA 
AAACCCAAGA 6780 

GTGCTGGTTG TGTCTAGTAG CGGGACCCCC CTACTACGAA GGGGTTGCCG 
TCCTGGGTAC 684 0 

CTACTCCAAC CATACCTCTG CTCCAGCCAA CTGCTCCGTG GCCTCCCAAC 
ACAAGTTGAC 6900 

CCTGTCCGAA GTGACCGGAC AGGGACTCTG CATAGGAGCA GTTCCCAAAA 
CACATCAGGC 6960 

CCTATGTAAT ACCACCCAGA CAAGCAGTCG AGGGTCCTAT TATCTAGTTG 



/ 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/001 ,039 



DATE: 02/10/98 
TIME: 11:37:04 
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INPUT SET: S23358.raw 



CCCCTACAGG 



7020 



TACCATGTGG GCTTGTAGTA CCGGGCTTAC TCCATGCATC TCCACCACCA 
TACTGAACCT 7080 

TACCACTGAT TATTGTGTTC TTGTCGAACT CTGGCCAAGA GTCACCTATC 
ATTCCCCCAG 7140 

CTATGTTTAC GGCCTGTTTG AGAGATCCAA CCGACACAAA AGAGAACCGG 
TGTCGTTAAC 7200 

CCTGGCCCTA TTATTGGGTG GACTAACCAT GGGGGGAATT GCCGCTGGAA 
TAGGAACAGG 72 60 

GACTACTGCT CTAATGGCCA CTCAGCAATT CCAGCAGCTC CAAGCCGCAG 
TACAGGATGA 7320 

TCTCAGGGAG GTTGAAAAAT CAATCTCTAA CCTAGAAAAG TCTCTCACTT 
CCCTGTCTGA 7380 

AGTTGTCCTA CAGAATCGAA GGGGCCTAGA CTTGTTATTT CTAAAAGAAG 
GAGGGCTGTG 7440 

TGCTGCTCTA AAAGAAGAAT GTTGCTTCTA TGCGGACCAC ACAGGACTAG 
TGAGAGACAG 7500 

CATGGCCAAA TTGAGAGAGA GGCTTAATCA GAGACAGAAA CTGTTTGAGT 
CAACTCAAGG 7560 

ATGGTTTGAG GGACTGTTTA ACAGATCCCC TTGGTTTACC ACCTTGATAT 
CTACCATTAT 7620 

GGGACCCCTC ATTGTACTCC TAATGATTTT GCTCTTCGGA CCCTGCATTC 
TTAATCGATT 7680 

AGTCCAATTT GTTAAAGACA GGATATCAGT GGTCCAGGCT CTAGTTTTGA 
CTCAACAATA 7740 

TCACCAGCTG AAGCCTATAG AGTACGAGCC ATAGATAAAA TAAAAGATTT 
TATTTAGTCT 7800 

CCAGAAAAAG GGGGGAATGA AAGACCCCAC CTGTAGGTTT GGCAAGCTAG 
CTTAAGTAAC 7860 

GCCATTTTGC AAGGCATGGA AAAATACATA ACTGAGAATA GAGAAGTTCA 
GATCAAGGTC 7920 

AGGAACAGAT GGAACAGCTG AATATGGGCC AAACAGGATA TCTGTGGTAA 
GCAGTTCCTG 7980 

CCCCGGCTCA GGGCCAAGAA CAGATGGAAC AGCTGAATAT GGGCCAAACA 
GGATATCTGT 8040 
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599 
600 
601 
602 
603 
604 
605 
606 
607 
608 
609 
610 
611 
612 
613 



GGTAAGCAGT TCCTGCCCCG GCTCAGGGCC AAGAACAGAT GGTCCCCAGA 
TGCGGTCCAG 8100 

CCCTCAGCAG TTTCTAGAGA ACCATCAGAT GTTTCCAGGG TGCCCCAAGG 
ACCTGAAATG 8160 

ACCCTGTGCC TTATTTGAAC TAACCAATCA GTTCGCTTCT CGCTTCTGTT 
CGCGCGCTTC 82 20 

TGCTCCCCGA GCTCAATAAA AGAGCCCACA ACCCCTCACT CGGGGCGCCA 
GTCCTCCGAT 8280 

TGACTGAGTC GCCCGGGTAC CCGTGTATCC AATAAACCCT CTTGCAGTTG CA 
8332 



/ 



> 



714 
715 
716 

717 
718 
719 
720 
721 
722 
723 
724 
725 



(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TCGAGGGGCC CAGATCTGCG GCCGCTCGCG AGTCGACAAG CTTGGATCCA T 
51 



726 (2) INFORMATION FOR SEQ ID NO: 27: 

727 (i) SEQUENCE CHARACTERISTICS: 
— > 728 (A) LENGTH: 49 base pairs 

729 (B) TYPE: nucleic acid 

730 (C) STRANDEDNESS: single 

731 (D) TOPOLOGY: linear 

732 (ii) MOLECULE TYPE: DNA (genomic) 

, 733 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

7 34 CGATGGATCC AAGCTTGTCG ACTCGCGAGC GGCCGCAGAT CTGGGCCCC 

735 49 
736 



899 (2) INFORMATION FOR SEQ ID NO: 44: 

900 (i) SEQUENCE CHARACTERISTICS: 

— > 901 (A) LENGTH: 9080 base pairs 

902 (B) TYPE: nucleic acid 

903 (C) STRANDEDNESS: single 

904 (D) TOPOLOGY: linear 

905 (ii) MOLECULE TYPE: DNA (genomic) 

906 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

907 GGGGGGGGGG GGGGGGGGGG GGGTGAGCAC ATCCAGTGGG TAAAGTTCCT 

— > 908 TAAAATGCTC 60 

90 9 TGCAAAGAAA TTGGGACTTT TCATTAAATC AGAAATTTTA CTTTTTTCCC 

— > 910 CTCCTGGGAG 120 
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911 


CTAAAGATAT 


TTTAGAGAAG 


AATTAACCTT 


TTGCTTCTCC 


AGTTGAACAT 


— > 


912 


TTGTAGCAAT 


180 










913 


AAGTCATGCA 


AATAGAGCTC 


TCCACCTGCT 


TCTTTCTGTG 


CCTTTTGCGA 


— > 


914 


TTCTGCTTTA 


240 










915 


GTGCCACCAG 


AAGATACTAC 


CTGGGTGCAG 


TGGAACTGTC 


ATGGGACTAT 


— > 


916 


ATGCAAAGTG 


300 










917 


ATCTCGGTGA 


GCTGCCTGTG 


GACGCAAGAT 


TTCCTCCTAG 


AGTGCCAAAA 


— > 


918 


TCTTTTCCAT 


360 










919 


TCAACACCTC 


AGTCGTGTAC 


AAAAAGACTC 


TGTTTGTAGA 


ATTCACGGAT 


— > 


920 


CACCTTTTCA 


420 










921 


ACATCGCTAA 


GCCAAGGCCA 


CCCTGGATGG 


GTCTGCTAGG 


TCCTACCATC 


— > 


922 


CAGGCTGAGG 


480 










923 


TTTATGATAC 


AGTGGTCATT 


ACACTTAAGA 


ACATGGCTTC 


CCATCCTGTC 


— > 


924 


AGTCTTCATG 


540 










925 


CTGTTGGTGT 


ATCCTACTGG 


AAAGCTTCTG 


AGGGAGCTGA 


ATATGATGAT 


— > 


926 


CAGACCAGTC 


600 










927 


AAAGGGAGAA 


AGAAGATGAT 


AAAGTCTTCC 


CTGGTGGAAG 


CCATACATAT 


— > 


928 


GTCTGGCAGG 


660 










929 


TCCTGAAAGA 


GAATGGTCCA 


ATGGCCTCTG 


ACCCACTGTG 


CCTTACCTAC 


— > 


930 


TCATATCTTT 


720 










931 


CTCATGTGGA 


CCTGGTAAAA 


GACTTGAATT 


CAGGCCTCAT 


TGGAGCCCTA 


— > 


932 


CTAGTATGTA 


780 










933 


GAGAAGGGAG 


TCTGGCCAAG 


GAAAAGACAC 


AGACCTTGCA 


CAAATTTATA 


— > 


934 


CTACTTTTTG 


840 










935 


CTGTATTTGA 


TGAAGGGAAA 


AGTTGGCACT 


CAGAAACAAA 


GAACTCCTTG 


— > 


936 


ATGCAGGATA 


900 










937 


GGGATGCTGC 


ATCTGCTCGG 


GCCTGGCCTA 


AAATGCACAC 


AGTCAATGGT 


— > 


938 


TATGTAAACA 


960 










939 


GGTCTCTGCC 


AGGTCTGATT 


GGATGCCACA 


GGAAATCAGT 


CTATTGGCAT 


— > 


940 


GTGATTGGAA 


1020 










941 


TGGGCACCAC 


TCCTGAAGTG 


CACTCAATAT 


TCCTCGAAGG 


TCACACATTT 


— > 


942 


CTTGTGAGGA 


1080 










943 


ACCATCGCCA 


GGCGTCCTTG 


GAAATCTCGC 


CAATAACTTT 


CCTTACTGCT 


— > 


944 


CAAACACTCT 


1140 










945 


TGATGGACCT 


TGGACAGTTT 


CTACTGTTTT 


GTCATATCTC 


TTCCCACCAA 


— > 


946 


CATGATGGCA 


1200 










947 


TGGAAGCTTA 


TGTCAAAGTA 


GACAGCTGTC 


CAGAGGAACC 


CCAACTACGA 


— > 


948 


ATGAAAAATA 


1260 










949 


ATGAAGAAGC 


GGAAGACTAT 


GATGATGATC 


TTACTGATTC 


TGAAATGGAT 


— > 


950 


GTGGTCAGGT 


1320 










951 


TTGATGATGA 


CAACTCTCCT 


TCCTTTATCC 


AAATTCGCTC 


AGTTGCCAAG 


— > 


952 


AAGCATCCTA 


1380 










953 


AAACTTGGGT 


ACATTACATT 


GCTGCTGAAG 


AGGAGGACTG 


GGACTATGCT 


— > 


954 


CCCTTAGTCC 


1440 










955 


TCGCCCCCGA 


TGACAGAAGT 


TATAAAAGTC 


AATATTTGAA 


CAATGGCCCT 


— > 


956 


CAGCGGATTG 


1500 










957 


GTAGGAAGTA 


CAAAAAAGTC 


CGATTTATGG 


CATACACAGA 


TGAAACCTTT 


— > 


958 


AAGACTCGTG 


1560 










959 


AAGCTATTCA 


GCATGAATCA 


GGAATCTTGG 


GACCTTTACT 


TTATGGGGAA 


— > 


960 


GTTGGAGACA 


1620 










961 


CACTGTTGAT 


TATATTTAAG 


AATCAAGCAA 


GCAGACCATA 


TAACATCTAC 


— > 


962 


CCTCACGGAA 


1680 










963 


TCACTGATGT 


CCGTCCTTTG 


TATTCAAGGA 


GATTACCAAA 


AGGTGTAAAA 



/ i 



it) '- < 



# 




# 
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— > 


964 


CATTTGAAGG 


1740 










965 


ATTTTCCAAT 


TCTGCCAGGA 


GAAATATTCA 


AATATAAATG 


GACAGTGACT 


— > 


966 


GTAGAAGATG 


1800 










967 


GGCCAACTAA 


ATCAGATCCT 


CGGTGCCTGA 


CCCGCTATTA 


CTCTAGTTTC 


— > 


968 


GTTAATATGG 


1860 










969 


AGAGAGATCT 


AGCTTCAGGA 


CTCATTGGCC 


CTCTCCTCAT 


CTGCTACAAA 


— > 


970 


GAATCTGTAG 


1920 










971 


ATCAAAGAGG 


AAACCAGATA 


ATGTCAGACA 


AGAGGAATGT 


CATCCTGTTT 


— > 


972 

**¥ 9 mm 


TCTGTATTTG 

A * * at ■ A A A ***W 


1980 










97 3 


ATGAGAACCG 

A i X **mt* A* Vrf* A i A A 


AAGCTGGTAC 


CTCACAGAGA 


ATATACAACG 


CTTTCTCCCC 


— > 


974 


AATCCAGCTG 


2040 










97 5 


GAGTGCAGCT 


TGAGGATCCA 


GAGTTCCAAG 


CCTCCAACAT 


CATGCACAGC 


— > 


976 


ATCAATGGCT 

■ ■ 4 ***+9TmAT*\ * * 


2100 










97 7 

*S 9 1 


ATGTTTTTGA 

Xi A U X X X X X VJ Xi 


TAGTTTGCAG 


TTGTCAGTTT 

X X V#/ X Jfc XXX 


GTTTGCATGA 


GGTGGCATAC 

***** ***** X *mmt* ***** ***** A i X X i ***** 


> 


978 


TGGTACATTC 


2160 

<w -A> w V/ 










97 9 


TAAGCATTGG 

X rin VJ V_* J. X VJ VJ 


AGCACAGACT 


GAPTTPPTTT 


PTGTPTTCTT 

V^ X VJ X V** x X v^ X X 


CTCTGGATAT 

*w X X VJ VJ XX X Xi X 


> 


980 


ACCTTCAAAC 

riww X X V^ am ■ ■ *r» V^ 


2220 










981 

J KJ X, 


ACAAAATGGT 

n\^nn/in J. vj vj x 


CTATGAAGAC 


ACACTCACCC 

Jfc ii V^ X V-h j-i w v^ 


TATTCCCATT 

X 4*. X X *w XX X X 


CTCAGGAGAA 


> 


982 

Iw O At 


ACTGTCTTCA 

■ ft X A A X aft 


2280 

Xi mm \J \J 










y u J 


TGTCGATGGA 

X VJ X V-* VJ xx J. VJ VJ f\ 


AAACCCAGGT 


CTATGGATTC 

V** X ii X VJ VJ £"\ X X V-' 


TGGGGTGCCA 

X VJ VJ VJ VJ X VJ V-c V-c ii. 


CAACTCAGAC 

*w XiXi X V AV AV 


> 


984 


TTTPRnAAPA 


2340 










985 

J VJ *J 


GAGGCATGAC 

\Jri\J vj v^- J. wnVrf 


CGCCTTACTG 


AAGGTTTCTA 

at\atwj vj xxx V— ' x n 


GTTGTGACAA 

VJ X X VJ X VJ n V^< AA 


GAACACTGGT 

wf*fiwrvw x vj vj x 




986 

7 O U 


GATTATTACG 


2400 










987 


AGGACAGTTA 

•TVVJ VJ *T.VJ X X T\ 


TGAAGATATT 


TCAGCATACT 


TGCTGAGTAA 


AAACAATGCC 

AAAVAA X WV V 


> 


988 

7 O O 


ATTGAACCAA 

*r\ x x \inn^^nn 


2460 










98 9 


GAAGCTTCTC 


CCAGAATTCA 


AGACACCCTA 


GCACTAGGCA 

VJ w f\Vf X xx VJ VJ V^ A 


AAAGCAATTT 


> 


990 

7 7 \l 


AATGCCACCA 


2520 










9 9 1 

Zf s A. 


CA ATTCCAGA 


AAATGACATA 


GAGAAGACTG 


ACCCTTGGTT 

niV^v_>v^ x x vjvj x x 


TGCACACAGA 

X \JVy AVyAVvnWA 


> 


99 2 

7 7 Ai 


ACACCTATGC 


2580 

mmt m3 U \i 










9 9 1 
y y -j 


PTAAA ATACA 


A AATGTrTrr 


TCT A GTG ATT 

x v> x .nvJ x n x x 


TGTTG ATGPT 

x vj x x un x \j\_> x 


CTTGCGACAG 

v_^ x x vj Vv wriv^r\vj 


> 


994 

7 7 


AGTCCTACTC 


2640 










995 

j s -j 


CACATGGGCT 


ATCCTTATCT 


GATCTCCAAG 

VJ/1 X V— ' X Vw* \w* ^X^L VJ 


AAGCCAAATA 

AAVJ V^- V^- AAA X *»> 


TGAGACTTTT 

x \jnvjr\vy x x x x 


> 


996 

7 7 (J 


TCTGATGATC 


2700 










997 


CATCACCTGG 


AGCAATAGAC 


AGTAATAACA 


GCCTGTCTGA 

VJ V^ V_* X VJ X X VJ A*. 


AATGACACAC 

Xi xi x w Avnvnw 


> 


998 

7 7 W 


TTCAGGCCAC 

x x v# \j n 


2760 










999 


AGCTCCATCA 


CAGTGGGGAC 


ATGGTATTTA 

ii X \^ W X AA. XXX «L 


CCCCTGAGTC 

V^ V*^ V^ V-' X VJ *» X 


AGGCCTCCAA 

i i *^mff ***** ***** ****** X ***** ***-+ A i -i i 


— > 


1000 


TTAAGATTAA 


2820 

mm W X* V 










1001 

x, V v ***** 


ATGAGAAACT 


GGGGACAACT 


GCAGCAACAG 


AGTTGAAGAA 

J i ^— ^ X X X i i i A A A A 


ACTTGATTTC 

XX V*^ X X ***** A i XXX ^— ' 


— > 


1002 

Xi W w mm 


A A A GTTTC T A 


2880 










100 3 


GTACATCAAA 


TAATCTGATT 

X X ' X X X 


TCAACAATTC 

X **m-r A A. A L \^ A A A A X X 


CATCAGACAA 

ft X ***** A A **M-r A A. ^m-rT A i * i 


TTTGGCAGCA 


— > 


1004 


GGTACTGATA 


2940 










1005 


ATACAAGTTC 


CTTAGGACCC 


CCAAGTATGC 


CAGTTCATTA 


TGATAGTCAA 


— — "> 


100 £ 


TTACATAfTA 












1007 


CTCTATTTGG 


CAAAAAGTCA 


TCTCCCCTTA 


CTGAGTCTGG 


TGGACCTCTG 


— > 


1008 


AGCTTGAGTG 


3060 










1009 


AAGAAAATAA 


TGATTCAAAG 


TTGTTAGAAT 


CAGGTTTAAT 


GAATAGCCAA 


— > 


1010 


GAAAGTTCAT 


3120 










1011 


GGGGAAAAAA 


TGTATCGTCA 


ACAGAGAGTG 


GTAGGTTATT 


TAAAGGGAAA 


— > 


1012 


AGAGCTCATG 


3180 










1013 


GACCTGCTTT 


GTTGACTAAA 


GATAATGCCT 


TATTCAAAGT 


TAGCATCTCT 


— > 


1014 


TTGTTAAAGA 


3240 










1015 


CAAACAAAAC 


TTCCAATAAT 


TCAGCAACTA 


ATAGAAAGAC 


TCACATTGAT 


— > 


1016 


GGCCCATCAT 


3300 











PAGE: 13 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/001,039 



DATE: 02/10/98 
TIME: 1 1:37:15 



INPUT SET: S23358.raw 





1017 


TATTAATTGA 


GAATAGTCCA 


TCAGTCTGGC 


AAAATATATT 


AGAAAGTGAC 


— > 


1018 


ACTGAGTTTA 


3360 










1019 


AAAAAGTGAC 


ACCTTTGATT 


CATGACAGAA 


TGCTTATGGA 


CAAAAATGCT 


— > 


1020 


ACAGCTTTGA 


3420 










1021 


GGCTAAATCA 


TATGTCAAAT 


AAAACTACTT 


CATCAAAAAA 


CATGGAAATG 


— > 


1022 


GTCCAACAGA 


3480 










1023 


AAAAAGAGGG 


CCCCATTCCA 


CCAGATGCAC 


AAAATCCAGA 


TATGTCGTTC 


— > 


1024 


TTTAAGATGC 


3540 










1025 


TATTCTTGCC 


AGAATCAGCA 


AGGTGGATAC 


AAAGGACTCA 


TGGAAAGAAC 


— > 


1026 


TCTCTGAACT 


3600 










1027 


CTGGGCAAGG 


CCCCAGTCCA 


AAGCAATTAG 


TATCCTTAGG 


ACCAGAAAAA 


— > 


1028 


TCTGTGGAAG 


3660 










1029 


GTCAGAATTT 

X ' A A * * A * X X X 


CTTGTCTGAG 


AAAAACAAAG 


TGGTAGTAGG 


AAAGGGTGAA 


— > 


1030 


TTTACAAAGG 


3720 










1031 


ACGTAGGACT 


CAAAGAGATG 


GTTTTTCCAA 


GCAGCAGAAA 

W -4 * X* V^X^X*XX 


CCTATTTCTT 

^— ^ X ATX XXX ' X X 


— > 


1032 


ACTAACTTGG 


3780 










1033 


ATAATTTACA 


TGAAAATAAT 


ACACACAATC 


AAGAAAAAAA 


AATTCAGGAA 


— > 


1034 


GAAATAGAAA 


3840 










1035 


AGAAGGAAAC 

at*** *»,*»*x ^— 


ATTAATCCAA 

4 A X X * A A A X N/ A A A A 


GAGAATGTAG 


TTTTGCCTCA 


GATACATACA 

AT* X AT* X A L AT* 


— > 


1036 

A \J —J V 


GTGACTGGCA 

X* A X**F»Xrf A XVXWtTS 


3900 










1037 


CTAAGAATTT 

^ X f A A A. X— ' A A A A X X X 


CATGAAGAAC 

XX X X^ * A^A a & X^* A A 


CTTTTCTTAC 

X X X X X X *fc 


TGAGCACTAG 


GCAAAATGTA 

AA & A. A^ A A A X X A A 


— > 


1038 


GAAGGTTCAT 


3960 










103 9 


ATGACGGGGC 


ATATGCTCCA 


GTACTTCAAG 


ATTTTAGGTC 

**■ x x x x nwv x 


ATTAAATGAT 

A A X X A A A A A A> X wfl X 


— > 


1040 


TCAACAAATA 


4020 










1041 

X Vs X- 


GAACAAAGAA 


ACACACAGCT 


CATTTCTCAA 


AAAAAGGGGA 

Arm. A^m A^m.n A^X U NJ U V^V 


GGAAGAAAAC 


— > 


1042 


TTGGAAGGCT 


4080 










104 3 


TGGGAAATCA 


AACCAAGCAA 

a a aa v** x^* nnv a & 


ATTGTAGAGA 


AATATGCATG 

m ± X *i x w X 


CACCACAAGG 

^—^ A A A A A A A A W V^* 


— > 


1044 


ATATCTCCTA 


4140 










104 5 


ATACAAGCCA 


GCAGAATTTT 


GTCACGCAAC 


GTAGTAAGAG 


AGCTTTGAAA 


— > 


1046 


CAATTCAGAC 


4200 










1047 


TCCCACTAGA 


AGAAACAGAA 


CTTGAAAAAA 


GGATAATTGT 


GGATGACACC 

Vrf* ATX X ATX ATX 


— > 


1048 

•X. V ^» \J 


TCAACCCAGT 

A X^ ■ • mm X^ X^ X^ ■ ■ XP A 


4260 

w W W 










104 9 


GGTCCAAAAA 

X^ Xp* X V-' « A, A A* A* AA A 


CATGAAACAT 


TTGACCCCGA 


GCACCCTCAC 


ACAGATAGAC 

Alt « * A A. X A A A A, 


— > 


1050 


TACAATGAGA 


4320 










1051 


AGGAGAAAGG 

A A X^ A & A * A A A & X»* 


GGCCATTACT 


CAGTCTCCCT 


TATCAGATTG 

x a * x n «x x x 


CCTTACGAGG 


— > 


1052 


AGTCATAGCA 


4380 










1053 


TCCCTCAAGC 


AAATAGATCT 


CCATTACCCA 


TTGCAAAGGT 


ATCATCATTT 


— > 


1054 


CCATCTATTA 


4440 










1055 


GACCTATATA 


TCTGACCAGG 


GTCCTATTCC 


AAGACAACTC 


TTCTCATCTT 


— > 


1056 


CCAGCAGCAT 


4500 










1057 


CTTATAGAAA 


GAAAGATTCT 


GGGGTCCAAG 


AAAGCAGTCA 


TTTCTTACAA 


— > 


1058 


GGAGCCAAAA 


4560 










105 9 

X v j y 


AAAATAAGCT 


X X ± X -L rt. \_5 - V— 


ATTCTAACCT 


TGGAGATGAC 


TGGTGATGAA 


— > 


1060 


AGAGAGGTTG 


4620 










1061 


GCTCCCTGGG 


GACAAGTGCC 


ACAAATTCAG 


TCACATACAA 


GAAAGTTGAG 


— > 


1062 


AACACTGTTC 


4680 










1063 


TCCCGAAACC 


AGACTTGCCC 


AAAACATCTG 


GCAAAGTTGA 


ATTGCTTCCA 


— > 


1064 


AAAGTTCACA 


4740 










1065 


TTTATCAGAA 


GGACCTATTC 


CCTACGGAAA 


CTAGCAATGG 


GTCTCCTGGC 


— > 


1066 


CATCTGGATC 


4800 










1067 


TCGTGGAAGG 


GAGCCTTCTT 


CAGGGAACAG 


AGGGAGCGAT 


TAAGTGGAAT 


— > 


1068 


GAAGCAAACA 


4860 










1069 


GACCTGGAAA 


AGTTCCCTTT 


CTGAGAGTAG 


CAACAGAAAG 


CTCTGCAAAG 
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— > 


1070 


ACTCCCTCCA 


4920 










1071 


AGCTATTGGA 


TCCTCTTGCT 


TGGGATAACC 


ACTATGGTAC 


TCAGATACCA 


— > 


1072 


AAAGAAGAGT 


4980 










1073 


GGAAATCCCA 


AGAGAAGTCA 


CCAGAAAAAA 


CAGCTTTTAA 


GAAAAAGGAT 


— > 


1074 


ACCATTTTGT 


5040 










1075 


CCCTGAACGC 


TTGTGAAAGC 


AATCATGCAA 


TAGCAGCAAT 


AAATGAGGGA 


— > 


1076 


CAAAATAAGC 


5100 










1077 


CCGAAATAGA 


AGTCACCTGG 


GCAAAGCAAG 


GTAGGACTGA 


AAGGCTGTGC 


— > 


1078 


TCTCAAAACC 


5160 










1079 


CACCAGTCTT 


GAAACGCCAT 


CAACGGGAAA 


TAACTCGTAC 


TACTCTTCAG 


— > 


1080 


TCAGATCAAG 


5220 










1081 


AGGAAATTGA 


CTATGATGAT 


ACCATATCAG 


TTGAAATGAA 


GAAGGAAGAT 


— > 


1082 


TTTGACATTT 


5280 










1083 


ATGATGAGGA 


TGAAAATCAG 


AGCCCCCGCA 


GCTTTCAAAA 


GAAAACACGA 


— > 


1084 


CACTATTTTA 


5340 










1085 


TTGCTGCAGT 


GGAGAGGCTC 


TGGGATTATG 


GGATGAGTAG 


CTCCCCACAT 


— > 


1086 


GTTCTAAGAA 


5400 










1087 


ACAGGGCTCA 


GAGTGGCAGT 


GTCCCTCAGT 


TCAAGAAAGT 


TGTTTTCCAG 


— > 


1088 


GAATTTACTG 


5460 










1089 


ATGGCTCCTT 


TACTCAGCCC 


TTATACCGTG 


GAGAACTAAA 


TGAACATTTG 


— > 


1090 


GGACTCCTGG 


5520 










1091 


GGCCATATAT 


AAGAGCAGAA 


GTTGAAGATA 


ATATCATGGT 


AACTTTCAGA 


— > 


1092 


AATCAGGCCT 


5580 










1093 


CTCGTCCCTA 


TTCCTTCTAT 


TCTAGCCTTA 


TTTCTTATGA 


GGAAGATCAG 


— > 


1094 


AGGCAAGGAG 


5640 










1095 


CAGAACCTAG 


AAAAAACTTT 


GTCAAGCCTA 


ATGAAACCAA 


AACTTACTTT 


— > 


1096 


TGGAAAGTGC 


5700 










1097 


AACATCATAT 


GGCACCCACT 


AAAGATGAGT 


TTGACTGCAA 


AGCCTGGGCT 


— > 


1098 


TATTTCTCTG 


5760 










1099 


ATGTTGACCT 


GGAAAAAGAT 


GTGCACTCAG 


GCCTGATTGG 


ACCCCTTCTG 


— > 


1100 


GTCTGCCACA 


5820 










1101 


CTAACACACT 


GAACCCTGCT 


CATGGGAGAC 


AAGTGACAGT 


ACAGGAATTT 


— > 


1102 


GCTCTGTTTT 


5880 










1103 


TCACCATCTT 


TGATGAGACC 


AAAAGCTGGT 


ACTTCACTGA 


AAATATGGAA 


— > 


1104 


AGAAACTGCA 


5940 










1105 


GGGCTCCCTG 


CAATATCCAG 


ATGGAAGATC 


CCACTTTTAA 


AGAGAATTAT 


— > 


1106 


CGCTTCCATG 


6000 










1107 


CAATCAATGG 


CTACATAATG 


GATACACTAC 


CTGGCTTAGT 


AATGGCTCAG 


— > 


1108 


GATCAAAGGA 


6060 










1109 


TTCGATGGTA 


TCTGCTCAGC 


ATGGGCAGCA 


ATGAAAACAT 


CCATTCTATT 


— > 


1110 


CATTTCAGTG 


6120 










1111 


GACATGTGTT 


CACTGTACGA 


AAAAAAGAGG 


AGTATAAAAT 


GGCACTGTAC 


— > 


1112 


AATCTCTATC 


6180 










1113 


CAGGTGTTTT 


TGAGACAGTG 


GAAATGTTAC 


CATCCAAAGC 


TGGAATTTGG 


— > 


1114 


CGGGTGGAAT 


6240 










1115 


GCCTTATTGG 


CGAGCATCTA 


CATGCTGGGA 


TGAGCACACT 


TTTTCTGGTG 


— > 


1116 


TACAGCAATA 


6300 










1117 


AGTGTCAGAC 


TCCCCTGGGA 


ATGGCTTCTG 


GACACATTAG 


AGATTTTCAG 


— > 


1118 


ATTACAGCTT 


6360 










1119 


CAGGACAATA 


TGGACAGTGG 


GCCCCAAAGC 


TGGCCAGACT 


TCATTATTCC 


— > 


1120 


GGATCAATCA 


6420 










1121 


ATGCCTGGAG 


CACCAAGGAG 


CCCTTTTCTT 


GGATCAAGGT 


GGATCTGTTG 


— > 


1122 


GCACCAATGA 


6480 











• 
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1123 


TTATTCACGG 


CATCAAGACC 


CAGGGTGCCC 


GTCAGAAGTT 


CTCCAGCCTC 


— > 


1124 


TACATCTCTC 


6540 










1125 


AGTTTATCAT 


CATGTATAGT 


CTTGATGGGA 


AGAAGTGGCA 


GACTTATCGA 


— > 


1126 


GGAAATTCCA 


6600 










1127 


CTGGAACCTT 


AATGGTCTTC 


TTTGGCAATG 


TGGATTCATC 


TGGGATAAAA 


— > 


1128 


CACAATATTT 


6660 










1129 


TTAACCCTCC 


AATTATTGCT 


CGATACATCC 


GTTTGCACCC 


AACTCATTAT 


— > 


1130 


AGCATTCGCA 


6720 










1131 


GCACTCTTCG 


CATGGAGTTG 


ATGGGCTGTG 


ATTTAAATAG 


TTGCAGCATG 


— > 


1132 


CCATTGGGAA 


6780 










1133 


TGGAGAGTAA 


AGCAATATCA 


GATGCACAGA 


TTACTGCTTC 


ATCCTACTTT 


— > 


1134 


ACCAATATGT 


6840 










1135 


TTGCCACCTG 


GTCTCCTTCA 


AAAGCTCGAC 


TTCACCTCCA 


AGGGAGGAGT 


— > 


1136 


AATGCCTGGA 


6900 










1137 


GACCTCAGGT 


GAATAATCCA 


AAAGAGTGGC 


TGCAAGTGGA 


CTTCCAGAAG 


— > 


1138 


ACAATGAAAG 


6960 










1139 


TCACAGGAGT 


AACTACTCAG 


GGAGTAAAAT 


CTCTGCTTAC 


CAGCATGTAT 


— > 


1140 


GTGAAGGAGT 


7020 










1141 


TCCTCATCTC 


CAGCAGTCAA 


GATGGCCATC 


AGTGGACTCT 


CTTTTTTCAG 


— > 


1142 


AATGGCAAAG 


7080 










1143 


TAAAGGTTTT 


TCAGGGAAAT 


CAAGACTCCT 


TCACACCTGT 


GGTGAACTCT 


— > 


1144 


CTAGACCCAC 


7140 










ai45 


CGTTACTGAC 


TCGCTACCTT 


CGAATTCACC 


CCCAGAGTTG 


GGTGCACCAG 


— > 


1146 


ATTGCCCTGA 


7200 










1147 


GGATGGAGGT 


TCTGGGCTGC 


GAGGCACAGG 


ACCTCTACTG 


AGGGTGGCCA 


— > 


1148 


CTGCAGCACC 


7260 










1149 


TGCCACTGCC 


GTCACCTCTC 


CCTCCTCAGC 


TCCAGGGCAG 


TGTCCCTCCC 


— > 


1150 


TGGCTTGCCT 


7320 










1151 


TCTACCTTTG 


TGCTAAATCC 


TAGCAGACAC 


TGCCTTGAAG 


CCTCCTGAAT 


— > 


1152 


TAACTATCAT 


7380 










a is 3 


CAGTCCTGCA 


TTTCTTTGGT 


GGGGGGCCAG 


GAGGGTGCAT 


CCAATTTAAC 


— > 


1154 


TTAACTCTTA 


7440 










115 5 


CCTATTTTCT 


GCAGCTGCTC 


CCAGATTACT 


CCTTCCTTCC 


AATATAACTA 


— > 


1156 


GGCAAAAAGA 


7500 










115 7 


AGTGAGGAGA 


AACCTGCATG 


AAAGCATTCT 


TCCCTGAAAA 


GTTAGGCCTC 


— > 


1158 


TCAGAGTCAC 


7560 










1159 


CACTTCCTCT 


GTTGTAGAAA 


AACTATGTGA 


TGAAACTTTG 


AAAAAGATAT 


— > 


1160 


TTATGATGTT 


7620 










1161 


AACATTTCAG 


GTTAAGCCTC 


ATACGTTTAA 


AATAAAACTC 


TCAGTTGTTT 


— > 


1162 


ATTATCCTGA 


7680 










1163 


TCAAGCATGG 


AACAAAGCAT 


GTTTCAGGAT 


CAGATCAATA 


CAATCTTGGA 


— > 


1164 


GTCAAAAGGC 


7740 










1165 


AAATCATTTG 


GACAATCTGC 


AAAATGGAGA 


GAATACAATA 


ACTACTACAG 


— > 


1166 


TAAAGTCTGT 


7800 










1167 


TTCTGCTTCC 


TTACACATAG 


ATATAATTAT 


GTTATTTAGT 


CATTATGAGG 


— > 


1168 


GGCACATTCT 


7860 










1169 


TATCTCCAAA 


ACTAGCATTC 


TTAAACTGAG 


AATTATAGAT 


GGGGTTCAAG 


— > 


1170 


AATCCCTAAG 


7920 










1171 


TCCCCTGAAA 


TTATATAAGG 


CATTCTGTAT 


AAATGCAAAT 


GTGCATTTTT 


— > 


1172 


CTGACGAGTG 


7980 










1173 


TCCATAGATA 


TAAAGCCATT 


TGGTCTTAAT 


TCTGACCAAT 


AAAAAAATAA 


— > 


1174 


GTCAGGAGGA 


8040 










1175 


TGCAATTGTT 


GAAAGCTTTG 


AAATAAAATA 


ACAATGTCTT 


CTTGAAATTT 
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— 


-> 


1176 


CTCATGGCCA 


8100 












1177 


AGAAAGAAAA 


TGATGATGAC 


ATTAGGCTTC 


TAAAGGACAT 


ACATTTAATA 


— 


-> 


1178 


TTTCTGTGGA 


8160 












1179 


AATATGAGGA 


AAATCCATGG 


TTATCTGAGA 


TAGGAGATAC 


AAACTTTGTA 


— 


-> 


1180 


ATTCTAATAA 


8220 












1181 


TGCACTCAGT 


TTACTCTCTC 


CCTCTACTAA 


TTTCCTGCTG 


AAAATAACAC 


— 


-> 


1182 


AACAAAAATG 


8280 












1183 


TAACAGGGGA 


AATTATATAC 


CGTGACTGAA 


AACTAGAGTC 


CTACTTACAT 


_ 


-> 


1184 


AGTTGAAATA 


8340 












1185 


TCAAGGAGGT 


CAGAAGAAAA 


TTGGACTGGT 


GAAAACAGAA 


AAAACACTCC 





-> 


1186 


AGTCTGCCAT 


8400 












1187 


ATCACCACAC 


AATAGGATCC 


CCCTTCTTGC 


CCTCCACCCC 


CATAAGATTG 





-> 


1188 


TGAAGGGTTT 


8460 












* 1189 


ACTGCTCCTT 


CCATCTGCCT 


GACCCCTTCA 


CTATGACTAC 


ACAGAATCTC 





-> 


1190 


CTGATAGTAA 


8520 












1191 


AGGGGGCTGG 


AGGCAAGGAT 


AAGTTATAGA 


GCAGTTGGAG 


GAAGCATCCA 


_ 


-> 


1192 


AAGATTGCAA 


8580 












1193 


CCCAGGGCAA 


ATGGAAAACA 


GGAGATCCTA 


ATATGAAAGA 


AAAATGGATC 


_ 


-> 


1194 


CCAATCTGAG 


8640 












1195 


AAAAGGCAAA 


AGAATGGCTA 


CTTTTTTCTA 


TGCTGGAGTA 


TTTTCTAATA 




-> 


1196 


ATCCTGCTTG 


8700 












1197 


ACCCTTATCT 


GACCTCTTTG 


GAAACTATAA 


CATAGCTGTC 


ACAGTATAGT 


— 


-> 


1198 


CACAATCCAC 


8760 












1199 


AAATGATGCA 


GGTGCAAATG 


GTTTATAGCC 


CTGTGAAGTT 


CTTAAAGTTT 




-> 


1200 


AGAGGCTAAC 


8820 












1201 


TTACAGAAAT 


GAATAAGTTG 


TTTTGTTTTA 


TAGCCCGGTA 


GAGGAGTTAA 




-> 


1202 


CCCCAAAGGT 


8880 












1203 


GATATGGTTT 


TATTTCCTGT 


TATGTTTAAC 


TTAATAATCT 


TATTTTGGCA 




-> 


1204 


TTCTTTTCCC 


8940 












1205 


ATTGACTATA 


TACATCTCTA 


TTTCTCAAAT 


GTTCATGGAA 


CTAGCTCTTT 




-> 


1206 


TATTTTCCTG 


9000 












1207 


CTGGTTTCTT 


CAGTAATGAG 


TTAAATAAAA 


CATTGACACA 


TACAAAAAAA 




-> 


1208 

1209 
1210 


AAAAAAAAAA 

AAAAAAAAAA 


9060 

AAAAAAAAAA 









9080 




1662 (2) INFORMATION FOR SEQ ID NO: 46: 

1663 (i) SEQUENCE CHARACTERISTICS: 
— > 1664 (A) LENGTH: 4832 base pairs 

1665 (B) TYPE: nucleic acid 

1666 (C) STRANDEDNESS: single 

1667 (D) TOPOLOGY: linear 

1668 (ii) MOLECULE TYPE: DNA (genomic) 

1669 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 6 : 

1670 CTCGAGCTAA AGATATTTTA GAGAAGAATT AACCTTTTGC TTCTCCAGTT 

— > 1671 GAACATTTGT 60 

1672 AGCAATAAGT CATGCAAATA GAGCTCTCCA CCTGCTTCTT TCTGTGCCTT 

— > 1673 TTGCGATTCT 120 

1674 GCTTTAGTGC CACCAGAAGA TACTACCTGG GTGCAGTGGA ACTGTCATGG 

— > 1675 GACTATATGC 180 

1676 AAAGTGATCT CGGTGAGCTG CCTGTGGACG CAAGATTTCC TCCTAGAGTG 

— > 1677 CCAAAATCTT 240 

1678 TTCCATTCAA CACCTCAGTC GTGTACAAAA AGACTCTGTT TGTAGAATTC 




ASK 
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— > 


1679 


ACGGATCACC 


300 










1680 


TTTTCAACAT 


CGCTAAGCCA 


AGGCCACCCT 


GGATGGGTCT 


GCTAGGTCCT 


— > 


1681 


ACCATCCAGG 


360 










1682 


CTGAGGTTTA 


TGATACAGTG 


GTCATTACAC 


TTAAGAACAT 


GGCTTCCCAT 


— > 


1683 


CCTGTCAGTC 


420 










1684 


TTCATGCTGT 


TGGTGTATCC 


TACTGGAAAG 


CTTCTGAGGG 


AGCTGAATAT 


— > 


1685 


GATGATCAGA 


480 










1686 


CCAGTCAAAG 


GGAGAAAGAA 


GATGATAAAG 


TCTTCCCTGG 


TGGAAGCCAT 


— > 


1687 


ACATATGTCT 


540 










1688 


GGCAGGTCCT 


GAAAGAGAAT 


GGTCCAATGG 


CCTCTGACCC 


ACTGTGCCTT 


— > 


1689 


ACCTACTCAT 


600 










1690 


ATCTTTCTCA 


TGTGGACCTG 


GTAAAAGACT 


TGAATTCAGG 


CCTCATTGGA 


— > 


1691 


GCCCTACTAG 


660 










1692 


TATGTAGAGA 


AGGGAGTCTG 


GCCAAGGAAA 


AGACACAGAC 


CTTGCACAAA 


— > 


1693 


TTTATACTAC 


720 










1694 


TTTTTGCTGT 


ATTTGATGAA 


GGGAAAAGTT 


GGCACTCAGA 


AACAAAGAAC 


— > 


1695 


TCCTTGATGC 


780 










1696 


AGGATAGGGA 


TGCTGCATCT 


GCTCGGGCCT 


GGCCTAAAAT 


GCACACAGTC 


— > 


1697 


AATGGTTATG 


840 










1698 


TAAACAGGTC 


TCTGCCAGGT 


CTGATTGGAT 


GCCACAGGAA 


ATCAGTCTAT 


— > 


1699 


TGGCATGTGA 


900 










1700 


TTGGAATGGG 


CACCACTCCT 


GAAGTGCACT 


CAATATTCCT 


CGAAGGTCAC 


— > 


1701 


ACATTTCTTG 


960 










1702 


TGAGGAACCA 


TCGCCAGGCG 


TCCTTGGAAA 


TCTCGCCAAT 


AACTTTCCTT 


— > 


1703 


ACTGCTCAAA 


1020 










1704 


CACTCTTGAT 


GGACCTTGGA 


CAGTTTCTAC 


TGTTTTGTCA 


TATCTCTTCC 


— > 


1705 


CACCAACATG 


1080 










1706 


ATGGCATGGA 


AGCTTATGTC 


AAAGTAGACA 


GCTGTCCAGA 


GGAACCCCAA 


— > 


1707 


CTACGAATGA 


1140 










1708 


AAAATAATGA 


AGAAGCGGAA 


GACTATGATG 


ATGATCTTAC 


TGATTCTGAA 


— > 


1709 


ATGGATGTGG 


1200 










1710 


TCAGGTTTGA 


TGATGACAAC 


TCTCCTTCCT 


TTATCCAAAT 


TCGCTCAGTT 


— > 


1711 


GCCAAGAAGC 


1260 










1712 


ATCCTAAAAC 


TTGGGTACAT 


TACATTGCTG 


CTGAAGAGGA 


GGACTGGGAC 


— > 


1713 


TATGCTCCCT 


1320 










1714 


TAGTCCTCGC 


CCCCGATGAC 


AGAAGTTATA 


AAAGTCAATA 


TTTGAACAAT 


— > 


1715 


GGCCCTCAGC 


1380 










1716 


GGATTGGTAG 


GAAGTACAAA 


AAAGTCCGAT 


TTATGGCATA 


CACAGATGAA 


— > 


1717 


ACCTTTAAGA 


1440 










1718 


CTCGTGAAGC 


TATTCAGCAT 


GAATCAGGAA 


TCTTGGGACC 


TTTACTTTAT 


— > 


1719 


GGGGAAGTTG 


1500 










1720 


GAGACACACT 


GTTGATTATA 


TTTAAGAATC 


AAGCAAGCAG 


ACCATATAAC 


— > 


1721 


ATCTACCCTC 


1560 










1722 


ACGGAATCAC 


TGATGTCCGT 


CCTTTGTATT 


CAAGGAGATT 


ACCAAAAGGT 


— > 


1723 


GTAAAACATT 


1620 










1724 


TGAAGGATTT 


TCCAATTCTG 


CCAGGAGAAA 


TATTCAAATA 


TAAATGGACA 


— > 


1725 


GTGACTGTAG 


1680 










1726 


AAGATGGGCC 


AACTAAATCA 


GATCCTCGGT 


GCCTGACCCG 


CTATTACTCT 


— > 


1727 


AGTTTCGTTA 


1740 










1728 


ATATGGAGAG 


AGATCTAGCT 


TCAGGACTCA 


TTGGCCCTCT 


CCTCATCTGC 


— > 


1729 


TACAAAGAAT 


1800 










1730 


CTGTAGATCA 


AAGAGGAAAC 


CAGATAATGT 


CAGACAAGAG 


GAATGTCATC 


— > 


1731 


CTGTTTTCTG 


1860 










# 
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INPUT SET: S23358.raw 

17 32 TATTTGATGA GAACCGAAGC TGGTACCTCA CAGAGAATAT ACAACGCTTT 

— > 1733 CTCCCCAATC 19 20 

17 34 CAGCTGGAGT GCAGCTTGAG GATCCAGAGT TCCAAGCCTC CAACATCATG 

- - > 173 5 CACAGCATCA 1980 jL S 



— > 



1736 
1737 
1738 
1739 
1740 
1741 
1742 



ATGGCTATGT TTTTGATAGT 
GCATACTGGT 2040 



TTGCAGTTGT CAGTTTGTTT GCATGAGGTG 




— > 



ACATTCTAAG CATTGGAGCA 
GGATATACCT 2100 



CAGACTGACT TCCTTTCTGT CTTCTTCTCT 



TCAAACACAA AATGGTCTAT 
GGAGAAACTG 2160 



GAAGACACAC TCACCCTATT CCCATTCTCA 



— > 



TCTTCATGTC GATGGAAAAC 



CCA 
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